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Abstract: Three-stage t-tests of separated one-sided hypotheses are derived, 
extending Lorden's optimal three-stage tests for the one-dimensional expo- 
nential family by using Lai and Zhang's generalization of Schwarz's optimal 
fully-sequential tests to the multiparameter exponential family. The resulting 
three-stage t-tests are shown to be asymptotically optimal, achieving the same 
average sample size as optimal fully-sequential tests. 



1. Introduction 

Lorden ^ derived asymptotically optimal three-stage tests of separated one-sided 
hypotheses about the parameter of a one-dimensional exponential family. Lorden's 
tests use the initial stage to estimate the unknown parameter and then use this to 
choose the size of the second stage close to the average sample size of Schwarz's 
optimal sequential tests. Lorden showed that the resulting three-stage tests are as 
asymptotically efficient as optimal fully-sequential tests, asymptotically achieving 
Hoeffding's [l| lower bound on the average sample size of a sequential test with 
given error probabilities. Lorden showed that, conversely, three stages were also 
necessary for asymptotic efficiency, except in a few degenerate cases. Moreover, Lor- 
den showed that any efficient three-stage test must mimic Schwarz's sequential 
test in this way. 

Lai [2!] modified Schwarz's 0] tests of separated one-sided hypotheses to allow the 
distance between the hypotheses to approach zero, and hence derived asymptoti- 
cally optimal sequential tests of hypotheses with or without indifference regions. Lai 
and Zhang [ij extended Lai's 0] tests to general hypotheses in the multiparameter 
exponential family setting by developing and applying certain results on boundary 
crossing probabilities of multiparameter generalized likelihood ratio statistics. 

In this paper we present three-stage t-tests of one-sided, separated hypotheses 
about the mean of i.i.d. normal data whose variance is unknown. On one hand, 
this is an extension of the tests of Lorden Q to the two-parameter setting. On the 
other hand, this is a first step in adapting the multiparameter sequential tests of 
Lai and Zhang [3] to the multistage setting. Theorem 12.21 shows that the expected 
sample size of our three-stage t-teat asymptotically achieves the Hoeffding [l| lower 
bound as the error probabilities approach zero. Converse results and extensions are 
discussed in Section [31 



2. Three-stage t-tests 

Let Xi, X2, ... be i.i.d. N{fi, cr^), where both fj, and are unknown. We consider 
testing 
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It is assumed that 



Ho ■■ fJ-< fJ.0 vs. Hi : fi> fio- 



< (T^ < o-^ < CT^ 



In practice, such bounds may be indicated by prior experience with the type of 
data or they may be imphed by practical considerations hke desired maximum and 
minimum total sample sizes. For example, if the Xi represent the difference between 
paired treatment and control responses in a clinical trial, previous experience with 
similar trials may suggest values of Alternatively, it may be more natural 

for practitioners to work in terms of minimum and maximum sample size, which 

and imply, as will be seen below. 

Let L„(u, u^) denote the log- likelihood function 



i„(w,v2) ^-(n/2)log^.2-(2^.2)" 



^=l 



iX, - uf 



The log of the generalized likelihood ratio (GLR) is, for i = 0, 1, 
Ai_„ = supi„(M, u^) - supL„(^i,ti^) 



(n/2) log 



1 



(2.1) 

(2.2) =nh{X,,,dl), 
where Xn is the sample mean, 

n ^ — ^ 



Xn — 



Xr. 



and 



/,(u,«2) = (i/2)log[l + (u-/.,)'M 

is the KuUback-Leibler information number. The term being squared inside the log 
in (|2.ip is of course a multiple of the usual t statistic, but it will be more natural 
in what follows to work with Aj „ rather than the t statistic. Given Aq, Ai > 0, a, 
GLR test of Hq vs. Hi rejects Hq if 



(2.3) 

and rejects Hi if 
(2.4) 



Xn > Mo and Ao^„ > Ao, 



Xn < Ml and Ai,„ > Ai. 



The boundaries (|2.3p and (|2.4p define "stopping surfaces" in (n, A^, ^"(A^ 
A„)^)-space. The surface (|2.3|) intersects the line 



(2.5) 



1 



A,; 



un, 



E 

1 



(A, 



Xn?^v^n 



at a point whose n-coordinate is 
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where we let no(/io,w^) = oo since I^ilJ^Q^v^) = 0. Define ni(u,w^) — w^) 
similarly with respect to the surface (|2.4p . and let 



n{u, V ) — min nilu, v ). 

i=0,l 

Let fj,2 — M2(^0; ^i) G (/Lto, ^i) be the unique solution of 

Ai " /i(M2,a2)' 

which exists because the right hand side increases from to +oo as /12 ranges from 
fiQ to /ii, and let n = n(/i2,5'^)- Since ni{u,v'^) < ni(u,'a'^) for all u and < o"^, 

(2.6) n = n{ii2,'3'^) — snpn{u,a^) ~ sup n{u,v'^). 

Hence n is roughly the "worst-case" expected total sample size required to cross 
one of the boundaries (|2.3p or (12. 4p when the variance is no greater than a^. 

Asymptotic optimality results will be proved for bounded values of /i G [/^jM] 
where 

< ^0 < Ml < M- 

As with the bounds on cr^, in practice these values may be suggested by prior 
knowledge or by the constraints on maximum and minimum sample size that they 
imply. Let 

J= [^t,7l] X [a^,a^], 
Je = - e,JI + s) X (ct^ - e, ct^ + e) 

for e > 0. 

Before defining the stopping rule of our three stage i-test we state an auxiliary 
lemma that gives bounds on how close the estimated total sample size after sampling 
k, n{Xk,d'l), is to the "correct" total sample size n{^,a'^). This is an extension of 
Lemma 1 of Lorden [5] to the two parameter setting. 

Lemma 2.1. Let < e < g_^. There is a positive constant B such that if 



p„ = 1 + B\/n ilog 



n, 



then 



(2.7) 



P^,,2 ( {Xk,dl) e J,,p^' < ^^^^^ < PkJor all k > ko 



> l-0(l/fco) 
as fco 00, uniformly for (/i, a^) G J. 

Proof. Using large deviations probabilities (e.g., Lai & Zhang Lemma 1) it can 
be shown that there is a > such that 

(2.8) Pi_i,a^{\'Xk - Ai| > ?7 some fc > fco) < exp(-/3?7^fco) 

(2.9) Pf,^^2{\al -<J^\>r] some k > kg) < exp(-/3r;2fc^) 
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as fco — > 00, uniformly in (p, cr^) £ J^. By a straightforward Taylor series argument 
it can be shown that there is Z? < oo such that 



(2.10) 



n{u, u^) 



< D{\u - mI + k'^ - cr^D 



for all {u,v^) e Je, (MiO"^) ^ 

Let y be the event in (|2.7p . Denote P^.o-a simply by P. First write 



(2.11) 



n(^,cr2) 



> , some k > Kq 



+ P{{Xk,dl) i J£,some fc > /cq)- 

The latter is no greater than 

P(|Xfc - [i\> £,some fc > fco) + P(|CTfc - cr^l > e, some fc > fco) < 2 exp(-2/3'fcn) 

for some > by (|2.8p and (|2.9p . Since this is obviously o(l/fco), the proof of the 
lemma will be complete once we show that the former term in (I2.11[) is O(l/fco). 
Using p.lOp . this term is no greater than the sum over all fc > fco of 



P Ui 



Pfc - 1 
2D 



<P Xfe-^ > 



Pfc - 1 
4P> 



4P> 



By 



and (|2.9p . for some /3" > this is no greater than 



2exph/3"(pfc - l)2fc2] = J2 2exp(-/3"Plogfc) = ^ 2/fc2 = O(l/fco), 

k>kf) k>ko k>ko 



by choosing B — 2/P" 



□ 



We can now define the stopping times Ni,N2,N3 of the three stage i-test. Let 
m = [n] , Aq, A\ > 0, and choose < C < 1 and < e < . Let 

Ni= [C[n{ri,a^)An{fi,a^)]\ 

7V2 = m A (TVi V \p%^n(XN, , ^n, )] ) 
TVs = m. 

The test stops at the end of stage i < 3 and rejects ffo if 

(2.12) CT^ - e < (T^^ < CT^ + e, ^o < -''^w. < + e, and Ao,jv, > ^o- 

The test stops at the end of stage i < 3 and rejects Hi if 



(2.13) 



e < a%. < (T^ + e, ^ - e < X^^ < fii, and Ai jy, > ^i- 



Note that when the total sample size is m and af^ < ct^, one of (|2.12p . (|2.13p is 
guaranteed to happen by our choice of n in (12. 6p . In the event that the total sample 
size is m but neither (|2.12p nor (|2.13p happens, we define the test to reject Hq if 
and only if Ao,™ > Ai^^. 
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Theorem 2.2. Let N ~ N{A^^Ai) he the total sample size of the test defined 
above, and let 

(2.14) a, = P^^ ^ {Reject H,) = sup [Reject H,) {i = 0,1). 

Let C{ao, ai) be the class of all (possibly sequential) tests of Hq vs. Hi with error 
probabilities {2.1^^ no greater than aQ,ai. As Aq^Ai — > oo such that Aq ^ Ai, 

E^^^2N < n{fi, a^) + 0{^A,\ogA,) 

(2.15) = inf E^,^^2N' + 0{^A, log A,) 

N'eC{aa, ai) 

uniformly for (fi, cr^) G J, for either i = 0, 1. 

Proof. To bound E^ ,j2N from above we follow the three-stage argument of Lorden 
[il with Lemma [23] Let V be the event in (|2.7p with fco = -^^i- We will show that 

(2.16) N <\pl^n{XN^,dl,)] onV. 
It is true that Ni < n{^, cr^) since in fact 

n{fi,(T'^) > inf n{n,a'^) 

(2.17) =infn(u,CT2) 

u 

= n{fj,,a^) A n{ji, a^) 
> C-^Ni > Ni, 

where (I2.17[) holds since no{-,a^) is non-decreasing and ni{-,a^) is non-increasing. 
If the test stops after the first stage, 

N = Ni <n(ji,cr^) < PnM.^ n^^'^nJ < \ PN^n(X n,,'^%J] 

on V. It N ~ N2 ~ m, then by definition of A''2, 

N = m< \p%^n(XN,,o-%J] 

on V. Otherwise, N2 < m and so, on V, 

N2=\p%^n(XN^,CT%J] 

> pN^n{p,,(T'^) 

> {PnJ PN2)n{XN.,,&lj^) > n(XAr2,(T^J. 

This, along with {X n2^'^'n.2) ^ '-^^^ implies that two stages suffice for stopping in 
this case. Hence 

^ = ^2 < \pl^n(XN,,al^)-\, 

establishing (|2.16p . 

Conditioning on V and using Lemma l2.1l gives 

(2.18) <n{^l,a'') + 0{m{pN,-l)) + 0{l) 

= n{p, cr^) + 0(v/A, log A,), 
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Since 



m{pN, - 1) = mBJN^^ logiVi 



-o(A,v/(logA:)M,) 



= O (V A: log A,) . 



Consider the fully-sequential test with rejection rule (|2.12l) and (|2.13p with Ni 
replaced by n, A^i < n < m. Lai Q established that the error probabilities of 
this test are 0{e~^^Af). Since the error probabilities of this fully sequential test 
obviously bound the error probabilities of our three-stage test, we have 



Using this estimate with Hoeffding's |]| lower bound in the two-parameter expo- 
nential family setting gives 



Combining this with our bound for E^ ,y2N in (|2.18p gives the desired result. □ 
3. Converse results and extensions 

It is natural to suspect that the converse results obtained by Lorden hold in 
this two-parameter setting. Namely, that essentially no two-stage test can achieve 
the asymptotic efficiency (j2.15p and that any efficient three-stage test must have 
a total sample size close to n(/i, cr^) after it's second stage. Indeed, it is easily 
shown that Theorem 2 and Corollary 2 of Lorden [Bj extend immediately to this 
setting. However, the i-test may be deceptively simple since the hypotheses in 
question are one-dimensional. It is not yet clear whether these properties of optimal 
tests will carry over to the general multiparameter setting, especially when the 
dimensions of the hypotheses exceed one or there is no type of "separation" between 
the hypotheses, as we had here. A first step toward answering these questions would 
be to adapt the asymptotically optimal fully-sequential tests of Lai & Zhang [4] to 
the general multistage setting, which remains to be done. 
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